Roles of static and dynamic features of formant trajectories in the perception of talk indedivduality

نویسندگان

  • Weizhong Zhu
  • Hideki Kasuya
چکیده

Experiments were performed to investigate perceptual contributions of static and dynamic features of vocal tract characteristics to talker individuality. An ARX (Autoregressive with exogenous input) speech production model was used to extract separately voice source and vocal tract parameters from a Japanese sentence, /aoiueoie/ ("Say blue top" in English). The Discrete Cosine Transform (DCT) was applied to resolve formant trajectories of the speech signal into static and dynamic components. The perceptual contributions were quantitatively studied by systematically replacing the corresponding formant components extracted from Japanese sentences uttered by three males. Results of the experiments show that the static (average) characteristic of the vocal tract is a primary cue to talker individuality.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Iranian TOEFL iBT and the IELTS Teachers’ Views on the Structure of the TOEFL iBT and IELTS Receptive and Productive Sections in terms of Dynamic and Static Assessment

This mixed-methods design study investigated Iranian TOEFL iBT and IELTS teachers’ views on thestructure of the TOEFL iBT and IELTS receptive and productive sections from the yardsticks of dynamic and static assessment. It also examined the conformity level of the receptive and productive sections of TOEFL iBT and IELTS to dynamic assessment and static assessment standards. To achieve the objec...

متن کامل

Statistical Variation Analysis of Formant and Pitch Frequencies in Anger and Happiness Emotional Sentences in Farsi Language

Setup of an emotion recognition or emotional speech recognition system is directly related to how emotion changes the speech features. In this research, the influence of emotion on the anger and happiness was evaluated and the results were compared with the neutral speech. So the pitch frequency and the first three formant frequencies were used. The experimental results showed that there are lo...

متن کامل

3D and 4D Seismic Data Integration in Static and Dynamic Reservoir Modeling: A Review

Reservoir modeling is the process of generating numerical representations of reservoir conditions and properties on the basis of geological, geophysical, and engineering data measured on the Earth’s surface or in depth at a limited number of borehole locations. Therefore, reservoir modeling requires an incorporation of the data from a variety of sources, along with an integration of knowledge a...

متن کامل

بررسی اثر فیدبک شنوائی در تولید گفتار بعد از عمل کوکلئار ایمپلنت

The main goal of this study is to determine the auditory feedback effects in improvement of speech production process in prelingual totally deaf children who used cochlear implant prosthesis. For this reason, we recorded speech of four prelingual cochlear implant children pre and post of operation. Then we extract some static features of vowels-such as fundamental frequency, formant frequencies...

متن کامل

The Application of Tactile Experience in Urban Perception

Urban perception is the result of mutual transaction between human and environment and the process of perception is developed through the three continuous steps of “sensation”, “perception” and “cognition”. In the first step (sensorial perception), the environmental signals are received via sensorial sensors and each different sense based on its own essence, performance and ability has its own ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997